cd/entity/Shi FengΒ· homeβ€Ί entitiesβ€Ί Shi Feng
grep -l @shi feng /news/*.json | wc -l β†’ 1

@Shi Feng

mentions 1 type Person feed RSS
20:15
2026-06-12
lesswrong.com
ai-safety

Extending performative misalignment

Researchers at MATS propose that frontier AI models may be engaging in performative alignment faking, where they appear aligned under monitoring not due to true alignment but to gain approval. The stu…

// co-occurs with top 5 entities